Analyzing Imputed Survey Data Sets with Model Assisted Estimators

نویسنده

  • Robert E. Fay
چکیده

Multiple imputation has been previously applied for mass imputation, that is, the imputation from a subsample with complete information to a larger sample. In such applications, the missing data rates are often substantial, such as 80 percent or more, but valid inferences should, in principle, be within reach when probability sampling is used. Yet, limitations of multiple imputation can become severely amplified in this setting (Fay 1994). This article combines an alternative strategy to multiple imputation, called fractionally weighted imputation, with a model assisted approach to estimation. This combination affords several advantages: 1) more efficient model based estimates when the model is true, 2) consistent estimates of the variances for the model based estimates, 3) resistance of the model assisted estimates to model failure, 4) consistence estimates of the variance of the model assisted estimates, 5) estimates of bias for the model based estimates, and 6) consistent estimates of the variance of the bias estimates. With this array of information, analysts will be in an improved position to analyze imputed data sets effectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets

Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...

متن کامل

Inference with Survey Data Imputed by Hot Deck When Nonrespondents Are Nonidentifiable

Hot deck imputation for nonrespondents is often used in surveys. It is a common practice to treat the imputed values as if they are true values, and compute survey estimators and their variance estimators using standard formulas. The variance estimators, however, have seriously negative biases when the rate of nonresponse is appreciable. Methods such as the multiple imputation and the adjusted ...

متن کامل

Pseudo-GEE Approach to Analyzing Longitudinal Surveys under Imputation for Missing Responses

This paper presents a pseudo-GEE approach to the analysis of longitudinal surveys when the response variable contains missing values. A cycle-specific marginal hotdeck imputation method is proposed to fill in the missing responses and the pseudo-GEE method described in Carrillo et al. (2009) is applied to the imputed data set. Consistency of the resulting pseudo-GEE estimators is established un...

متن کامل

Variance Estimation for Imputed Survey Data with Non-negligible Sampling Fractions

We consider variance estimation for Horvitz-Thompson type estimated totals based on survey data with imputed nonrespondents and with non-negligible sampling fractions. A method based on a variance decomposition is proposed. Our method can be applied to complicated situations where a composite of some deterministic and/or random imputation methods is used, including using imputed data to impute....

متن کامل

Bayesian paradigm for analysing count data in longitudina studies using Poisson-generalized log-gamma model

In analyzing longitudinal data with counted responses, normal distribution is usually used for distribution of the random efffects. However, in some applications random effects may not be normally distributed. Misspecification of this distribution may cause reduction of efficiency of estimators. In this paper, a generalized log-gamma distribution is used for the random effects which includes th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002